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Abstract 

The quantum query complexity of Boolean matrix multiplication is typically studied as a 
function of the matrix dimension, n, as well as the number of Is in the output, I. We prove an 
upper bound of 0{n^/i) for all values of £. This is an improvement over previous algorithms 
for all values of t. On the other hand, we show that for any e < 1 and any I < eri^, there is an 
ll(n\/£) lower bound for this problem, showing that our algorithm is essentially tight. 

We first reduce Boolean matrix multiplication to several instances of graph collision. We 
then provide an algorithm that takes advantage of the fact that the underlying graph in all of 
our instances is very dense to find all graph collisions efficiently. 

1 Introduction 

Quantum query complexity has been of fundamental interest since the inception of the field of 
quantum algorithms [BB BV971 IGro96| ISho97| . The quantum query complexity of Boolean matrix 
multiplication was first studied by Buhrman and Spalek [BS06]. In the Boolean matrix multiplica- 
tion problem, we want to multiply two n x n matrices A and B over the Boolean semiring, which 
consists of the set {0, 1} with logical OR (V) as the addition operation and logical and (A) as the 
multiplication operation. 

For this problem it is standard to consider an additional parameter in the complexity: The 
number of Is in the product C := AB, which we denote by i. We study the query complexity as a 
function of both n and i, and obtain improvements for all values of i. 

The problem of Boolean matrix multiplication is of fundamental interest, in part due to its 
relationship to a variety of graph problems, such as the triangle finding problem and the all-pairs 
shortest path problem. 
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Classically, it was shown by Vassilevska Williams and Williams that "practical advances in 
triangle detection would imply practical [Boolean matrix multiplication] algorithms" [VWlOj . The 
previous best quantum algorithm for Boolean matrix multiplication, by Le Gall, is based on a 
subroutine for finding triangles in graphs with a known tripartition [Gall2], already suggesting that 
the relationship between Boolean matrix multiplication and triangle finding might be more complex 
for quantum query complexity. We give further evidence for this by bypassing the triangle finding 
subroutine entirely. 

Despite its fundamental importance, much has remained unknown about the quantum query 
complexity of Boolean matrix multiplication and its relationship with other query problems in the 
quantum regime. Even for the simpler decision problem of Boolean matrix product verification, 
where we are given oracle access to three n x n Boolean matrices, A, B and C, and must decide 
whether or not AB = C, the quantum query complexity is unknown. The best upper bound is 
0(n3/2) [BS06], wher eas the lower bound was recently improved from the trivial 0(n) to 0(n ) 
by Childs, Kimmel, and Kothari |CKK11| . 

A better understanding of these problems may lead to an improved understanding of quantum 
query complexity in general. We contribute to this by closing the gap (up to logarithmic factors) 
between the best known upper and lower bounds for Boolean matrix multiplication for all i < en? 
for any constant e < 1. 



Previous Work. We are interested in the query complexity of Boolean matrix multiplication, 
where we count the number of accesses (or queries) to the input matrices A and B. Buhrman and 
Spalek BS06 Section 6.2] describe how to perform Boolean matrix multiplication using 0{n^^'^\/l) 
queries, by simply quantum searching for a pair {i,j) G [n] x [n] such that there is some k £ [n] 
for which j4[z,A;] = B[k,j] = 1, where [re] = {1, . . . ,re}. By means of a classical reduction relating 
Boolean matrix multiplication and triangle finding, Vassilevska Williams and Williams [VWlOj were 
able to combine the quantum triangle finding algorithm of Magniez, Santha and Szegedy |MSS07| 
with a classical strategy of Lingas |Lin09| to get a quantum algorithm for Boolean matrix multipli- 
cation with query complexity 0(min{re-^-^£-^'''/^°, re^ + re^^/^^£^'^/^°}). 

Recently, Le Gall [Gall2j improved on their work by noticing that the triangle finding needed 
for Boolean matrix multiplication involves a tripartite graph with a known tripartition. He then 
recast the known quantum triangle finding algorithm of |MSS07| for this special case and improved 
the query complexity of Boolean matrix multiplication. He then further improved the algorithm 
for large £ by adapting the strategy of Lingas to the quantum setting. 



Our Contributions. Since previous quantum algorithms for Boolean matrix multiplication are 
based on a triangle finding subroutine, a natural question to ask is whether triangle finding is a 
bottleneck for this problem. We show that this is not the case by bypassing the triangle finding 
problem completely to obtain a nearly tight result for Boolean matrix multiplication. 

A key ingredient of the best known quantum algorithm for triangle finding is an efficient al- 
gorithm for the graph collision problem. Our main contribution is to build an algorithm directly 
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on graph collision instead, bypassing the use of a triangle finding algorithm. Surprisingly, we do 
not use the graph collision algorithm that is used as a subroutine in the best known quantum 
algorithm for triangle finding. That algorithm is based on Ambainis' quantum walk for the element 
distinctness problem |Amb04] . Our algorithm, on the other hand, does not have any quantum 
walks. 

There are two main ideas we would like to impress upon the reader. First, we can reduce the 
Boolean matrix multiplication problem to several instances of the graph collision problem. Second, 
the instances of graph collision that arise depend on t, in particular, they have at most ^ non-edges. 
Furthermore, we need to find all graph collisions, not just one. We provide an algorithm to find a 
graph collision in query complexity 0(\/£+-y/n), or to find all graph collisions in time 0(\/Z+\/nA), 
where A > 1 is the number of graph collisions. Combining these ideas yields the aforementioned 
upper bound. 

A lower bound of il{n^fl) for all values of ^ < en^ for any constant e < 1 follows from a simple 



reduction to ^-Threshold, which we state formally in Theorem 4.2 



This paper is organized as follows. After presenting some preliminaries in Section [2| we describe 
in Section |3| the graph collision problem, its relationship to Boolean matrix multiplication, and a 
subroutine for finding all graph collisions when there are at most ^ non-edges. In Section|4j we apply 
our graph collision subroutine to get the stated upper bound for Boolean matrix multiplication, 
and then describe a tight lower bound that applies to all values of £ < en^ for e < 1. 



2 Preliminaries 

2.1 Quantum Query Framework 

For a more thorough introduction to the quantum query model, see jBBC^Ol] . For Boolean matrix 
multiplication, we assume access to two query operators that act as follows on a Hilbert space 
spanned by : i, j G [n],6 G {0,1}}: 

Oa ■■ \i,j,b) ^ \i,j,b(BA[i,j]) Ob : \i,j,b) ^ \i,j,b(BB[i,j]) 

In the quantum query model, we count the uses of Oa and Ob, and ignore the cost of implementing 
other unitaries which are independent of A and B. We call Oa and Ob the oracles, and each access 
a query. The query complexity of an algorithm is the maximum number of oracle accesses used by 
the algorithm, taken over all inputs. 

We denote a problem P by a map X ^ 2-^ , where P(x) <^y denotes the set of valid outputs on 
input X. We say a quantum algorithm A solves a problem P: ^1" — )• 2-^ with bounded error if 
for all X £ X, Pr[A(x) E > 1 — where is the size of the input. The quantum query 

complexity of P is the minimum query complexity of any quantum algorithm that solves P with 
bounded error 6{\x\) < 1/3. 

We will use the phrase with high probability to mean probability at least 1 — for some super- 
linear polynomial. We ensure that all our subroutines succeed with high probability, to achieve 
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a bounded-error algorithm at the end. To achieve such high probabihty, we wih necessarily incur 
polylog factors. We will use the notation O to indicate that we are suppressing polylog factors. 
More precisely, f{n) G 0{g{n)) means f{n) G 0{g{n)\og^ n) for some constant k. 



Boolean Matrices. We let B denote the Boolean semiring, which is the set {0, 1} under the 
operations V, A. The problem we will be considering is formally defined as the following: 

Boolean Matrix Multiplication 

Oracle Input: Two Boolean matrices A,B & B"^". 

Output: C G B"^" such that C = AB. 

In B"^", we say that C = AB if for ah i,j G [n], C[i,j] = VLi ^[^' ^] ^ B[k,j]. We wiU use 
the notation A + B to denote the entry-wise V of two Boolean matrices. 



2.2 Quantum Search Algorithms 

In this section we examine some well-known variations of the search problem that we require. The 
reader familiar with quantum search algorithms may skip to Section [3j 

Any search problem can be recast as searching for a marked element among a given collection, 
U. In order to formalize this, let / : C/ — )■ {0, 1} be a function whose purpose is to identify marked 
elements. An element is marked if and only if /(x) = 1. Define tj = In Grover's search 

algorithm, the algorithm can directly access /, and the overall complexity can be stated as the 
number of queries to /. In the following t > 1 is an integer parameter. 

Theorem 2.1 ([Gro96]). There is a quantum algorithm, GroverSearch(t), with query complexity 
0{^/\U\Ji) to f, such that, ift/2 < tf < t, then GroverSearch(i) finds a marked element with 
probability at least 1 — l/poly(|C/|). 

Moreover, iftj = 0, then GroverSearch(t) declares with probability 1 that there is no marked element. 

There are several ways to generalize the above statement when no approximation of t is known. 
Most of the generalizations in the literature are stated in terms of expected query complexity, such 
as in |BBHT98] . Nonetheless, one can derive from |BBHT98l Lemma 2] an algorithm in terms of 
worst case complexity, when only a lower bound t on tf is known. The algorithm consists of T 
iterations of one step of the original Grover algorithm where T is chosen uniformly at random from 
[0, -y/|?7|/t ]. This procedure is iterated 0(log \U\) times in order to get bounded error l/poly(|C/|). 



Corollary 2.2. There is a quantum algorithm Search(t) with query complexity 0{^y\U\/t) to f, 
such that, iftf >t, then Search (t) finds a marked element with probability at least 1 — l/poly(|[/|). 
Moreover, iftf = 0, then Search(t) declares with probability 1 that there is no marked element. 



One consequence of Corollary 2.2 is that we can always apply Search (t) with t = 1, when no 
lower bound on is given. In that case, we simply refer to the resulting algorithm as Search. Its 



query complexity to / is then 0{x/\U\). 
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Another simple generalization is for finding all marked elements. This generalization is stated 
in the literature in various ways for expected and worst case complexity. For the sake of clarity we 
explicitly describe one version of this procedure using GroverSearch as a subroutine. This version 
is robust in the sense that it works even when the number of marked elements may decrease 
arbitrarily. This may occur, for example, when the finding of one marked element may cause 
several others to become unmarked. This situation will naturally occur in the context of Boolean 
matrix multiplication. Then the complexity will only depend on the number of elements that are 
actually in the output, as opposed to the number of elements that were marked at the beginning 
of the algorithm. 

SearchAII 

1. Let t = \U\, and V = U 

2. While t > 1 

(a) Apply GroverSearch(t) to V 

(b) If a marked element x is found: Output x; Set V V — {x} and t t — 1 
Else: 

3. If no marked element has been found, declare 'no marked element' 

Corollary 2.3. SearchAII has query complexity 0{y^\U\{tf + 1)) to f, and finds all marked elements 
with probability at least 1 — l/poly(|C/|). 

Moreover, iftj = 0, then SearchAII declares with probability 1 that there is no marked element. 

We end this section with an improvement of GroverSearch when we are looking for an optimal 
solution for some notion of maximization. 

Theorem 2.4 ( jDH96| IDHHM06] ). Given a function g : [/ — t- M, there is a quantum algorithm, 
FindMax((/), with query complexity 0{^y\U\) to f, such that FindMax((7) returns x G such 
that g{x) = max^/gj-i(i) (7(x') with probability at least 1 — l/poly(|C/|). Moreover, if tj = 0, then 
FindMax(5() declares with probability 1 that there is no marked element. 

3 Graph Collision 

In this section we describe the graph collision problem, and its relation to Boolean matrix multi- 
plication. We then describe a method for solving the special case of graph collision in which we are 
interested. 
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3.1 Problem Description 

Graph collision is the following problem. Let G = {A,B,E) be a balanced bipartite graph on 2n 
vertices. We will suppose A = [n] and B = [n], though we note that in the bipartite graph, the 
vertex labelled by i in ^ is distinct from the vertex labelled by i in i3. 

Graph Collision(G) 

Oracle Input: A pair of Boolean functions /a'-A^ {0, 1} and Jb-B^ {0, 1}. 
Output: £ Ax B such that /^(i) = fsU) — 1 ^^"^ ihj) G E, if such a pair exists, 
otherwise reject. 

The graph collision problem was introduced by Magniez, Santha and Szegedy as a subprob- 
lem in triangle finding |MSS07j . The subroutine used to solve an instance of graph collision is 
based on Ambainis' quantum walk algorithm for element distinctness |Amb04j . and has query 
complexity 0(n^/^). The same subroutine is used in the current best triangle finding algorithm of 
Belovs |Belll| . However, the best known lower bound for this problem is ^}{^/n). It is an important 
open problem to close this gap. 

To obtain our upper bound, we do not use the quantum walk algorithm for graph collision, but 
rather, a new algorithm that takes advantage of two special features of our problem. The first is 
that we always know an upper bound, £, on the number of non-edges. When i < n, we can find a 
graph collision in 0{^/n) queries. The second salient feature of our problem is that we need to find 
all graph collisions. 



3.2 Relation to Boolean Matrix Multiplication 

Recall that the Boolean matrix product of A and B, can be viewed as the sum (entry- wise V) of 
n outer products: C = X]fc=i ^['^ ^]^[^' where denotes the k^^ column of A and B[k,-] 

denotes the k^^ row of B. 

For a fixed k, if there exists some i E [n] and some j G [n] such that A[i, k] = 1 and B[k,j] = 1, 
then we know that C[i,j] = 1, and we say that A; is a witness for (i, j). We are interested in finding 
all such pairs For each index k, we could search for all pairs with A[i,A;] = B[k,j] = 1; 

however, this could be very inefficient, since a pair may have up to n witnesses. Instead, we 
will keep a matrix C such that j] = 1 if we have already found a one at position Thus, 
we want to find a pair (i, j) such that A[i, k] = B[k,j] = 1 and C[i, j] = 0. That is, we want to find 
a graph collision in the graph with bi-adjacency matrix C, the entry- wise complement of C, and 

fA = A[;k], fB = B[k,-]. 

This gives the following natural algorithm for Boolean matrix multiplication, whose details and 



full analysis can be found in Section 4.1 



First, let C = 0. _ 

Search for an index k such that the graph collision problem on k with C as the underlying 

graph has a collision. 
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If no such k is found then we are done, and C is the product of A and B. 

Otherwise, find all the graph collisions on the graph defined by C with oracles 
and B[k, ■] and record them in C. 

Eliminate this k from future searches and search for another index k again. 
3.3 Algorithm for Graph Collision 

When G is a complete bipartite graph, then the relation between A and B defined by G is trivial. 
In that case, there is a very simple algorithm to find a graph collision: Search for some i G [n] 
such that = 1- Then search for some j € [n] such that fsij) = 1- Then is a graph 

collision pair. The query complexity of this is 0{^/n + \/n). However, when G is not a complete 
bipartite graph, there is a nontrivial relation between A and B. The best known algorithm solves 
this problem using a quantum walk. 

In our case, we can take advantage of the fact that the graph we are working with always has at 
most i non-edges — it is never more than distance i from the complete bipartite graph, which we 
know is easy to deal with. We are therefore interested in the query complexity of finding a graph 
collision in some graph with m non-edges, which we denote 0£(n,m). In our case, £ will always be 
an upper bound on m. 

For larger values of i, we will also make use of the fact that for some k, we will have multiple 
graph collisions to find. We let (S^gjiinjin, X) denote the query complexity of finding all graph 
collisions in a graph with m non-edges, where A is the number of graph collisions. It is not 
necessary to know A a priori. 

Again we note that if G is a complete bipartite graph, then we can accomplish the task of 
finding all graph collisions using SearchAII to search for all marked elements on each of /a and fs, 
and output /^^(l) x /^^(l). Letting = = so the total number of graph 

collision pairs is A = tAts, the query complexity of this method is 0{^/ntA + \/ntB) € 0(\/nA). So 
if G is close to being a complete bipartite graph, we would like to argue that we can do nearly as 
well. This motivates the following algorithm. 
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AWGCcifAjB) 



Let m denote the number of non-edges in G. Let di be the degree of the i^^ vertex in A, and let 
Ci := n — di. Let the vertices in A be arranged in decreasing order of degree, so that di > d2 > 
. . . > dn. We wiU say a vertex i in A (resp. B) is marked if = 1 (resp. fsii) = !)• 

1. Find the highest degree marked vertex in A using FindMax. Let r denote the index of this 
vertex. 0(-y/n) 

2. Case 1: If Cr < \/rn 



(a) Find ah marked neighbors of r by SearchAII. Output any graph colhsions found. O(vnA) 

(b) Delete all unmarked neighbors of r. Read the values of all non- neighbors of r. 0{^/m) 

(c) Let A' denote the subset of A consisting of all i G ^ with a marked neighbour in B. 
Find all marked vertices in A' by SearchAII. 0(\/nA) 

3. Case 2: If Cr > \/rri 



(a) Delete the first r — 1 vertices in A since they are unmarked. 

(b) Read the values of all remaining vertices in A. 0{y/rn) 

(c) Let B' denote the subset of B consisting of all j £ B with a marked neighbour in A. 
Find all marked vertices in B' by SearchAII. 0(\/nA) 



Theorem 3.1. For all A > 1, (SCall("', rn, A) G 0{VnX + -y/m) and <S(i{n, m) G + ^/m 

Proof. We will analyze the complexity of AIIGCG'(/y 
Step 1 has query c omp lexity O(y^) by Theorem 



j_£r) step by step. 



2.4 



Steps 2a, 2c and 3c have query complexity 



O(vnA) by Corollary 2.3 In Case 1, r has Cr < y/m non-neighbours, so we can certainly query 
them all in step 2b with 0{^/cp) G 0{y/m) queries. 

Consider Case 2, when Cr > \/m. We can ignore the first r — 1 vertices, since they are unmarked. 
Since the remaining n — r + 1 vertices all have Ci > Cr > \/rn, and the total number of non-edges 
is m, we have (n — r + 1) x ^/m <?7i=^(n — r + 1) < y/m. Thus, there are at most ^/m remaining 
vertices and querying them all costs at most 0{^/rn) queries. 

The query complexity of this algorithm is therefore 0{^/riX + \/rn), and it outputs all graph 
collisions. To check if there is at least one graph collision, instead of finding them all, we can replace 
finding all marked vertices using SearchAII in steps 2a, 2c and 3c, with a procedure to c heck if there 

rather than 



2.2 



is any marked vertex. Search, and this only requires 0(-^/n) queries by Corollary 
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4 Boolean Matrix Multiplication 



In this section we show how the graph cohision algorithm from the previous section can be used to 
obtain an efficient algorithm for Boolean matrix multiplication and then prove a lower bound. 

4.1 Algorithm 



What follows is a more precise statement of the high level procedure described in Section 3.2 



BMM{A,B) 

1. Let C = 0, t = n, and V = [n] 

2. While t > 1 

(a) GroverSearch(t) for an index k £ V such that the graph collision problem on k with C 
as the underlying graph has a collision. 

(b) If such a fc is found 

Compute AIIGC on the graph defined by C with oracles A[-, k] and B[k, ■] and record 
all output graph collisions in C. 
SetV - {k} andt^t-l. 

(c) Else: t ^ t/2 

3. Output C. 



Theorem 4.1. The query complexity of BOOLEAN Matrix Multiplication is 0(n\/£). 

Proof. We will analyze the complexity of the algorithm BMM{A, B). We begin by analyzing 



the cost of all the iterations in which we don't find a marked k. We have by Theorem 2.1 that 
GroverSearch(t) costs 0{\/n/t) queries to a procedure that checks if there is a collision in the graph 

defined by C with respect to and B[k,-], each of which costs (SC(n, mj), where < ^ is 

the number of Is in C at the beginning of the i^^ iteration. The cost of these steps is at most the 
following: 

/loen I 

Tl 

-&^{n,mi) ) G O 

log " / 





G O (n + \/^) y , ^ 

We now analyze the cost of all the iterations in which we do find a marked witness k. Let T be 
the number of witnesses found by BMM. Of course, T is a random variable that depends on which 
witnesses k are found, and in which order. We always have T < mm{n,i}. 
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Let ii, . . . ,iT he the indices of rounds where we find a witness. Let tj be the value of t in round 
j. Since there must be at least 1 marked element in the last round in which we find a marked 

element, wc have tij. > 1. Since wc find and eliminate at least 1 marked element in each round, we 
also have ti^^_^_^^ > ti^^_.^ + 1, which yields ti^^_^^ > j + 1 ^ ti^ >T - j + 1. 

Let \j be the number of graph collisions found on the j^^^ successful iteration, that is, the 
number of pairs witnessed by the j^^ witness, kj, that have not been recorded in C at the time we 
find kj. Then \j is also a random variable depending on which other witnesses k have been found 
already, but we always have = ^• 

Then we can upper bound the cost of all the iterations in which we do find a witness by the 
following: 

It, 

©C(n,mi,) + 6e:aii(n,mi.,Aj) ) 1 (1) 




T 



G O |^Vnr0£(n, .) + ^ ©£aii(n, . , A^) j (3) 

G O + Vri) +^ + (4) 

G O (^\/^ + n\/r + + T\/Z) (5) 

G O n min{n, + n\J minjn, ^} + V min{n, £}n£ + min{n, (6) 

G 6(n\/£) (7) 



In (4), we use the fact that nii^ < £, and in (5), we use 5^J=i < V^-^/X^j — ViT, which 
follows from the Cauchy-Schwarz inequality. □ 

4.2 Lower Bound 

Theorem 4.2. The query complexity of Boolean Matrix Multiplication is il{ny/mm{£, v? - £}) . 

Proof. We will reduce the problem of ^-Threshold, in which we must determine whether an input 
oracle / has > or < £ marked elements, to BOOLEAN Matrix Multiplication. 

Consider an instance of ^-Threshold of size v? , f : [n^] {0,1}. We can construct an 
instance of Boolean Matrix Multiplication as follows. Set A to the identity, and let B 
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encode /. Finding AB then gives the solution to the ^-Threshold instance. By fBBC+Oll . i- 
Threshold (with inputs of size n?) requires queries to solve with bounded 

error. □ 



This lower bound implies that our algorithm is tight for any £ < en^ for any constant e < 1. 
However, it is not tight for i = in? — o{n). We can search for pairs such that there is no A; G [n] 
that witnesses in cost rfi^'^. If there are m Os, we can find them all in 0(n'^/^-y/m), which is 
o{n\fi) when m E o(n). It remains open to close the gap between 0{n^/'^ yjrn) and Q.{ny/rn) when 
m G o(n^). 
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